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We develop a geometric framework to study the structure and function of complex networks. We 
assume that hyperbolic geometry underlies these networks, and we show that with this assumption, 
heterogeneous degree distributions and strong clustering in complex networks emerge naturally 
as simple reflections of the negative curvature and metric property of the underlying hyperbolic 
geometry. Conversely, we show that if a network has some metric structure, and if the network degree 
distribution is heterogeneous, then the network has an effective hyperbolic geometry underneath. 
We then establish a mapping between our geometric framework and statistical mechanics of complex 
networks. This mapping interprets edges in a network as non-interacting fermions whose energies are 
hyperbolic distances between nodes, while the auxiliary fields coupled to edges are linear functions of 
these energies or distances. The geometric network ensemble subsumes the standard configuration 
model and classical random graphs as two limiting cases with degenerate geometric structures. 
Finally, we show that targeted transport processes without global topology knowledge, made possible 
by our geometric framework, are maximally efficient, according to all efficiency measures, in networks 
with strongest heterogeneity and clustering, and that this efficiency is remarkably robust with respect 
to even catastrophic disturbances and damages to the network structure. 

PACS numbers: 89.75.Hc; 02.40.-k; 67.85.Lm; 89.75.Fb 



I. INTRODUCTION 

Geometry has a proven history of success, helping to 
make impressive advances in diverse fields of science, 
when a geometric fabric underlying a complex problem or 
phenomenon is identified. Examples can be found every- 
where. Perhaps the most famous one is general relativ- 
ity, interpreting gravitation as a curved geometry. Quite 
a contrasting example comes from the complexity the- 
ory in computer science, where apparently intractable 
computational problems suddenly find near optimal so- 
lutions as soon as a geometric underpinning of the prob- 
lem is discovered [5], leading to viable practical applica- 
tions [3]. Yet another example is the recent conjecture 
by Palmer P] suggesting that many "mysteries" of quan- 
tum mechanics can be resolved by the assumption that 
a hidden fractal geometry underlies the universe. 

Inspired by these observations, and following [5], we 
develop here a geometric framework to study the struc- 
ture and function of complex networks [!]• We be- 
gin with the assumption that hyperbolic geometry un- 
derlies these networks. Although difficult to visualize, 
hyperbolic geometry, briefly reviewed in Section |lTj is 
by no means anything exotic. In fact it is the geom- 
etry of the world we live in. Indeed, the relativistic 
Minkowski spacetime is hyperbolic, and so is the anti-de 
Sitter space ^-TO" . On the other hand, hyperbolic spaces 
can be thought of as smooth versions of trees abstracting 
the hierarchical organization of complex networks 
a key observation providing a high-level rationale. Sec- 



tion pTlfor our hyperbolic hidden space assumption. In 
Section |IV| we show that from this assumption, two com- 
mon properties of complex network topologies emerge 
naturally. Namely, heterogeneous degree distributions 
and strong clustering appear, in the simplest possible 
settings, as natural reflections of the basic properties of 
underlying hyperbolic geometry. The exponent of the 
power-law degree distribution, for example, turns out to 
be a function of the hyperbolic space curvature. Fortu- 
nately, unlike in [4], for instance, we can directly verify 
our assumption. In Section [V] we consider the converse 
problem, and show that if a network has some metric 
structure — tests for its presence are described in [T^] — 
and if the network's degree distribution is heterogeneous, 
then the network does have an effective hyperbolic geom- 
etry underneath. 

Many different pieces start coming together in Sec- 
tion |VI[ where we show that the ensembles of networks 
in our framework can be analyzed using standard tools 
in statistical mechanics. Hyperbolic distances between 
nodes appear as energies of corresponding edges dis- 
tributed according to Fermi-Dirac statistics. In this in- 
terpretation, auxiliary flelds, which have been consid- 
ered as opaque variables in the standard exponential 
graph formalism |13H17) . turn out to be linear func- 
tions of underlying distances between nodes. The chem- 
ical potential, Boltzmann constant, etc., also find their 
lucid geometric interpretations, while temperature ap- 
pears as a natural parameter controlling clustering in 
the network. The network ensemble exhibits a phase 
transition at a specific value of temperature, caused — 
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FIG. 1: (Color online) Poincare disk model. In (a), -Li,2,3 and Pi, 2, 3 are examples of hyperbolic lines. Lines -I/i,2,3 intersect 
to form triangle ABC. The sum of its angles a + b + c < n. As opposed to Euclidean geometry, there are infinitely many 
lines (examples are Pi, 2, 3) that are parallel to line Li and go through a point C that does not belong to Li. In (b), a 
{7, 3}-tessellation of the hyperbolic plane by equilateral triangles, and the dual {3, 7}-tessellation by regular heptagons are 
shown. All triangles and heptagons are of the same hyperbolic size but the size of their Euclidean representations exponentially 
decreases as a function of the distance from the center, while their number exponentially increases. In (c), the exponentially 
increasing number of men illustrates the exponential expansion of hyperbolic space. The Poincare tool [1] is used to construct 
a {7, 7}-tessellation of the hyperbolic plane, rendering a fragment of The Vitruvian Man by Leonardo da Vinci. 



as usual — by a non-analyticity of the partition function. 
This phase transition separates two regimes in the en- 
semble, cold and hot. Complex networks belong to the 
cold regime, while in the hot regime, the standard config- 
uration model [18 and classical random graphs [19 turn 
out to be two limiting cases with degenerate geometric 
structures, Section|lXj Sections [VII| and [VIII| analyze the 
degree distribution and clustering as functions of temper- 
ature in the two regimes. 

Finally, in Section |Xj we shift our attention to network 
function. Specifically, we analyze the network efficiency 
with respect to targeted communication or transport pro- 
cesses without global topology knowledge, made possi- 
ble by our geometric approach. We find that such pro- 
cesses in networks with strong heterogeneity and cluster- 
ing, guided by the underlying hyperbolic space, achieve 
the best possible efficiency according to all measures, and 
that this efficiency is remarkably robust with respect to 
even catastrophic levels of network damage. This finding 
demonstrates that complex networks have the optimal 
structure, allowing for routing with minimal overhead 
approaching its theoretical lower bounds, a notoriously 
difficult longstanding problem in routing theory, proven 
unsolvable for general graphs [SU] . 

II. HYPERBOLIC GEOMETRY 

In this section we review the basic facts about hyper- 
bolic geometry. More detailed accounts can be found 
in [HM]. 

There are only three types of isotropic spaces: Eu- 



clidean (flat), spherical (positively curved), and hyper- 
bolic (negatively curved). Hyperbolic spaces of constant 
curvature are difficult to envisage because they cannot be 
isometrically embedded into any Euclidean space. The 
reason is, informally, that the former are "larger" and 
have more "space" than the latter. 

Because of the fundamental difficulties in represent- 
ing spaces of constant negative curvature as subsets of 
Euclidean spaces, there are not one but many equivalent 
models of hyperbolic spaces. Each model emphasizes dif- 
ferent aspects of hyperbolic geometry, but no model si- 
multaneously represents all of its properties. In special 
relativity, for example, the hyperboloid model is com- 
monly used, where the hyperbolic space is represented 
by a hyperboloid. Its two different projections to disks 
orthogonal to the main axis of the hyperboloid yield 
the Klein and Poincare unit disk models. In the lat- 
ter model, the whole infinite hyperbolic plane H^, i.e., 
the two-dimensional hyperbolic space of constant curva- 
ture — 1, is represented by the interior of the Euclidean 
disk of radius 1, see Fig. [T] The boundary of the disk, 
i.e., the circle S^, is not a part of the hyperbolic plane, 
but represents its infinitely remote points, called bound- 
ary at infinity dM^ . Any symmetry transformation on 
translates to a symmetry on (911^, and vice versa., a 
cornerstone of the anti-de Sitter space/conformal field 
theory correspondence tSiilOj , where quantum gravity on 
an anti-de Sitter space is equivalent to a quantum field 
theory without gravity on the conformal boundary of the 
space. Hyperbolic geodesic lines in the Poincare model, 
i.e., shortest paths between two points at the boundary, 
are disk diameters and arcs of Euclidean circles intersect- 
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ing the boundary perpendicularly. The model is confor- 
mal, meaning that Euclidean angles between hyperbolic 
lines in the model are equal to their hyperbolic values, 
which is not true with respect to distances or areas. Eu- 
clidean and hyperbolic distances, and r/j, from the disk 
center, or the origin of the hyperbolic plane, are related 
by 

re = tanhY. (1) 

The model is generalizable for any dimension d ^ 2, in 
which case is represented by the interior of the unit 
ball whose boundary S''"^ is the boundary at infinity 
dMf^. The model is related via the stereographic pro- 
jection to another popular model — the upper half-space 
model — where is represented by a "half" of K"^ span 
by vectors x = {xi,X2, ■ ■ ■ , Xd) with Xd > 0. The bound- 
ary at infinity dW^ in this case is the hyperplane Xd = 
instead of S''^^. Essentially any d-dimensional space X 
with a (d — l)-dimensional boundary can be equipped 
with a hyperbolic metric structure, with the X's bound- 
ary playing the role of the boundary at infinity dX. 

Given the abundance of hyperbolic space representa- 
tions, we are free to choose any of those, e.g., the one 
most convenient for our purposes. Unless mentioned oth- 
erwise, we use the native representation in the rest of the 
paper. In this representation, all distance variables have 
their true hyperbolic values. In polar coordinates, for 
example, the radial coordinate r of a point is equal to 
its hyperbolic distance from the origin. That is, instead 
of 0, we have 

r = rh = re. (2) 

A key property of hyperbolic spaces is that they ex- 
pand faster than Euclidean spaces. Specifically, while 
Euclidean spaces expand polynomially, hyperbolic spaces 
expand exponentially. In the two-dimensional hyperbolic 
space of constant curvature K = —(^ < 0, C > 0, for 
example, the length of the circle and the area of the disk 
of hyperbolic radius r are 

L{r) = 27rsinhCr, (3) 
A{r) = 27r(coshCr - 1), (4) 

both growing as e^'' with r. The hyperbolic distance x 
between two points at polar coordinates (r, 9) and (r', 9') 
is given by the hyperbolic law of cosines 

cosh (x = cosh Cr cosh (r' — sinh sinh (r' cos A9, (5) 

where A9 = n — \tt — \9 — 9'\\ is the angle between the 
points. Equations (|3][5| converge to their familiar Eu- 
clidean analogs at C 0. For sufficiently large (r, (r' , 
and A9 > 2\/e~^^'' -I- e"-^^''', the hyperbolic distance x is 
closely approximated by 

, 2, . A6' , 2A9 

x — r + r + - in sm — r + r + - m — . (6) 



TABLE I: Characteristic properties of Euclidean, spherical, 
and hyperbolic geometries. Parallel lines is the number of 
lines that are parallel to a line and that go through a point 
not belonging to this line, and C, = \/| A"! . 
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Curvature K 
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thick 
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Shape of triangles 
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Sum of A angles 


TT 


> TT 


< TT 


Circle length 


27rr 


27r sin (^r 


2n sinh C,r 


Disk area 


27rrV2 


27r(l — cos C,r) 


27r(cosh C,r — 1) 



That is, the distance between two points is approximately 
the sum of their radial coordinates, minus some A9- 
dependent correction, which goes to zero at C — J' oo. 

Hyperbolic spaces are similar to trees. In a 6-ary tree 
(a tree with branching factor 6), the analogies of the circle 
length or disk area are the number of nodes at distance 
exactly r or not more than r hops from the root. These 
numbers are {b+l)h'-'^ and [(6 + 1)6'' - 2]/(6 - 1), both 
growing as V with r. We thus see that the metric struc- 
tures of and 6-ary trees are the same if C = In h: in 
both cases circle lengths and disk areas grow as e^''. In 
other words, from the purely metric perspective, Hj^nf, 
and 6-ary trees are equivalent. Informally, trees can 
therefore be thought of as "discrete hyperbolic spaces." 
Formally, trees, even infinite ones, allow nearly isometric 
embeddings into hyperbolic spaces. For example, any tes- 
sellation of the hyperbolic plane (see Fig.[T]) naturally de- 
fines isometric embeddings for a class of trees formed by 
certain subsets of polygon sides. For comparison, trees do 
not generally embed into Euclidean spaces. Informally, 
trees need an exponential amount of space for branching, 
and only hyperbolic geometry has it. 

Table H] collects these and other characteristic proper- 
ties of hyperbolic geometry and juxtaposes them against 
the corresponding properties of Euclidean and spherical 
geometries. 



III. TOPOLOGICAL HETEROGENEITY 
VERSUS GEOMETRICAL HYPERBOLICITY 

In this section we make high-level observations suggest- 
ing the existence of intrinsic connections between hyper- 
bolic geometry and the topology of complex networks. 

Complex networks connect distinguishable, heteroge- 
neous elements abstracted as nodes. Understood broadly, 
this heterogeneity implies that there is at least some tax- 
onomy of elements, meaning that all nodes can be some- 
how classified. In most general settings, this classification 
implies that nodes can be split in large groups consist- 
ing of smaller subgroups, which in turn consist of even 
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smaller subsubgroups, and so on. The relationships be- 
tween such groups and subgroups can be approximated 
by tree-like structures, sometimes called dendrograms, 
which represent hidden hierarchies in networks [Tl . But 
as discussed in the previous section, the metric structures 
of trees and hyperbolic spaces are the same. We empha- 
size that we do not assume that the node classification 
hierarchy among a particular dimension is strictly a tree, 
but that it is approximately a tree. As soon as it is at 
least approximately a tree, it is negatively curved |27j . 
This argument obviously applies only to a snapshot of a 
network taken at some moment of time. A logical ques- 
tion is how these taxonomies emerge. Clearly, when a 
network begins to form, the node classification is degen- 
erate, but as more and more nodes join the network and 
evolve in it, they tend to diversify and specialize, thus 
deepening their classification hierarchy. The distance be- 
tween nodes in such hierarchies is then a rough approxi- 
mation of node similarity, and the more similar a pair of 
nodes, the more likely they are connected. 

We consider several examples suggesting that these 
general observations apply to different real networks. So- 
cial networks form the most straightforward class of ex- 
amples, where network community structures |281 129] 
represent hidden hierarchies 30J . More concretely, in pa- 
per citation networks, the underlying geometries can ap- 
proximately be the relationships between scientific sub- 
ject categories, and the closer the subjects of two papers, 
the more similar they are, and the more likely they cite 
each other |31j |32j • Classifications of web pages (or more 
specifically, of the Wikipedia pages [33l [S^) also show 
the same effect: the more similar a pair of web pages, the 
more likely that there is a hyperlink between them |35j . 
In biology, the distance between two species on the phy- 
logenetic tree is a widely used measure of similarity be- 
tween the species [3B]. Note that this example empha- 
sizes both the existing taxonomy of elements and their 
evolution. The evolution of the Internet is yet another 
paradigmatic example. In the beginning, there were only 
a couple of computers connected to each other, but then 
the network grew [37j splitting into a collection of in- 
dependently administered networks, called autonomous 
systems (ASs), whose number and diversity have been 
growing fast [3S]. Currently, ASs can be classified based 
on their geographic position and coverage, size, number 
and type of customers, business role, and many other 
factors [551155]. 

The general observation that the metric structure of 
node similarity distances is hyperbolic follows from the 
mathematical fact illustrated in Fig. [2] We assume there 
that a point in represents an abstract node attribute 
or characteristic, while a Euclidean disk in M.'^ represents 
a collection of all the attributes for a given node in the 
network. The network itself is not shown. Instead we 
visualize a hidden hierarchy arising from the overlapping 
disks. The more two disks overlap, the more similar the 
sets of characteristics of the two corresponding nodes, 
that is, the more similar the nodes themselves. But the 




FIG. 2: Mapping between disks in the Euclidean plane 
and points in the Poincare half-space model of the three- 
dimensional hyperbolic space H'^ |21) . The x, j/-coordinates 
of disks in are the x, y-coordinates of the corresponding 
points in H^. The z-coordinates of these points in are 
the radii of the corresponding disks. This mapping repre- 
sents the tree-like hierarchy among the disks. Two points in 
are connected by a solid link if one of the corresponding 
disks is the minimum-size disk that fully contains the other 
disk. This hierarchy is not perfect; thus, the tree structure is 
approximate. The darkest disk in the middle partially over- 
laps with three other disks at different levels of the hierarchy. 
Two points in are connected by a dashed link if the cor- 
responding disks partially overlap. These links add cycles to 
the tree. The shown structure is thus not strictly a tree, but 
it is hyperbolic [27j . 



mapping between disks and nodes in IHI'^ in Fig. [2] is 
such that the more the two disks overlap, the hyperbol- 
ically closer are the corresponding two nodes. Formally, 
if the ratio of the disks' radii r, r' is bounded by a con- 
stant C, 1/C ^ r/r' ^ C, and the Euclidean distance 
between their centers is bounded by Cr, then the hy- 
perbolic distance between the corresponding nodes in 
is bounded by some constant C, which depends only 
on C, and not on the disk radii or center locations |27) . 
The converse is also true. Therefore, the distances be- 
tween nodes based on similarity of their attributes can 
be mapped to distances in a hyperbolic space, assuming 
that node attributes possess some metric structure (M^ 
in the above example) in the first place. 



IV. HYPERBOLIC GEOMETRY YIELDS 
HETEROGENEOUS TOPOLOGY 

We now put the intuitive considerations in the previous 
section to qualitative grounds. We want to see what net- 
work topologies emerge in the simplest possible settings 
involving hyperbolic geometry. 
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FIG. 3: The expected degree of a node at point X located 
at distance r from the origin O is proportional to the area of 
the dark-shaded intersection S{r) of the two disks of radius 
R. The first disk, centered at O, contains all the nodes, dis- 
tributed within it with a uniform density. The second disk, 
centered at X, is defined by the connection probability p{x), 
which is either 1 or depending on whether the distance x 
from X is less or greater than R. The node at X is con- 
nected to all the nodes lying in the dark-shaded intersection 
area S{r). 



A. Uniform node density at curvature K — —I 

Since the one-dimensional hyperbolic space does 
not exist, the simplest hyperbolic space is the hyperbolic 
plane discussed in Section |llj The simplest way to 
place ^ 1 nodes on the hyperbolic plane is to dis- 
tribute them uniformly over a disk of radius i? ^ 1, 
where R abstracts the depth of the hidden tree-like hier- 
archy. We will see below that i? is a growing function of 



N, reflecting the intuition in Section III that the network 
hierarchy deepens with network growth. The hyperbol- 
ically uniform node density implies that we assign the 
angular coordinates 9 G [0, 2tt] to nodes with the uniform 
density p{6) = l/(27r), while according to Eqs. (3j4| with 
C = 1, the density for the radial coordinate r e |0, R] is 
exponential 



To form a network, we need to connect each pair of nodes 
with some probability, which can depend only on hyper- 
bolic distances x between nodes. The simplest connection 
probability function is 



p{x) = e{R-x), 



(8) 



p{r) 



sinhr 
cosh R — 1 



(7) 



where 8(x) is the Heaviside step function. We will jus- 
tify and relax this choice in Section |VI| This connection 
probability means that we connect a pair of nodes by a 
link only if the hyperbolic distance ^ between them is 
X i^R. 

The network is now formed, and we can analyze its 
topological properties. We are first interested in the 
most basic one, the degree distribution P{k), to com- 
pute which we have to calculate the average degree k{r) 
of nodes located at distance r from the origin. Since 
the node density is uniform, k{r) is proportional to the 
area A(r) of the intersection S{r) of the two disks shown 
in Fig. m Specifically, fc(r) = SA{r) with node density 
S = A^/l27r(coshi? — 1)]. The area element d A in polar 
coordinates {y,0) is dA — sinhy dydO, cf. Eqs. (3|4) with 
C = 1. Therefore, the intersection area A{r) = Jj^^^j dA 
is given by the following integration illustrated in Fig. [3] 

A{r) = 2 / sinhy dy I d9 + 2 sinhy dy / d9 

Jo Jo J R-r Jo 

= 27r[cosh(i?- r) - 1] + 2 / 6'j^sinhydy, (9) 

Jr-t 



where 9y € [0, tt] is given by the hyperbolic law of 
cosmes ^ for the triangle AOXY in Fig.|3] 

cosh R = cosh r cosh y — sinh r sinh y cos 9y. (10) 



Solving the last equation for 9y and substituting the re- 
sult into Q yields the exact expression for A{r) and con- 
sequently for the average degree fc(r) 



, N , , s , / tanh(r/2) 

k(r) = — ; -<27r(coshi?-l)-2coshi? arcsin 

^' 27rcoshi?-l 1 ^ ' \ tanhi? 



arctan 



coshi?sinh(r/2) 



-|- arctan — ^ 



(coshi? 4- cosh r)\/ cosh 2R — cosh i 



•\/2(sinh^ R — coshi? — coshr) sinh(r/2) 



arctan 



^sinh(i? + r/2) sinh(i? - r/2) 
(cosh R — cosh r ) V cosh 2R — cosh r 
V^(sinh^ R + cosh R - cosh r) sinh(r /2) 



(11) 



which perfectly matches simulations in Fig. [4j For large 
R this terse exact expression is closely approximated by 



fc(r) = N 



'-Ne~'-'^, (12) 



where the last approximation holds for large r. 
The average degree in the network is then 



k = 



p{r)k{r)dr « -Ne'^'^, 



(13) 
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FIG. 4: (Color online) Average degree at distance r from the 
origin for a network with A'^ = 10000 and R — 16.55. 



from which we conclude that if we want to generate an 
A^-node network with a target average degree k we have 
to select the disk radius R = 2 ln[8Af/(7rfc)]. We see that 
R scales with iV as i? ~ IniV, i.e., the same way as the 
depth of a balanced tree with its size. We also observe 
that by fixing 



N 



(14) 



we gain control over the average degree in a n etw ork via 
parameter v — irk/S, using which we rewrite (12) as 



fc(r) 



'_^{R-r)/2 _ g-r/2_ 



(15) 



To finish computing the degree distribution P{k) we 
treat the radial coordinate r as a hidden variable in the 
terminology of |3D], yielding P{k) = g{k\r)p{r) dr, 
where the propagator g{k\r) is the conditional proba- 
bility that a node with hidden variable r has degree k. 
For sparse networks this propagator is Poissonian [40] . 
g{k\r) = e-'^^^^kirf /k\, using which we finally obtain 



P{k) = 2 



r(fc- 2,fc/2) 

fc! 



„-3 



(16) 



That is, the node degree distribution is a power law. 

This result is remarkable as we have done nothing to 
enforce this power law. Network heterogeneity has natu- 
rally emerged as a direct consequence of the basic proper- 
ties of hyperbolic geometry underlying the network. In- 
deed, the observed power law is a combination of two 
exponentials HTl. n ode density pir) in ([7| and average 
degree k{r) in ( |15| ), both reflecting the exponential ex- 
pansion of space in hyperbolic geometry discussed in Sec- 
tion HH 



B. Quasi-uniform node density at arbitrary 
negative curvature 

We next relax two constraints in the model. The first 
constraint is that the node density is exactly uniform. 
We let it be quasi- uniform, 



p{r) 



sinh ( 



cosh aR — 1 



(17) 



that is, exponential with exponent a > 0. In terms of 
the analogy with trees in Sections II|III this relaxation is 
equivalent to assuming that the hidden tree-like hierarchy 
has the average branching factor 6 = e". Second, we let 
the curvature of the hyperbolic space be any K = —Q"^ 
with C > 0. The node density is exactly uniform now 
only \i a — C,. 

The exact expression for the average degree fc(r) of 
nodes at distance r from the origin is the same as before, 



k{r) 



= N 




p{y) dydO 



p{v) dy 



(18) 



R-r 



p{y)Gy dy 



but we cannot compute it exactly to yield an answer anal- 



ogous to (11). However, approximations are easy. The 
main approximation deals with the angle Oj. in Fig. |3] 
Instead of (10), we now have according to (js) 



cosh C,R = cosh Cr cosh C^y — sinh C^r sinh C^y cos Qy , (19) 

which for large i?, r, and y yields Qy ~ 2e^'^^^''^^^/^. 
Substituting this dy in the integral for fc(r) (18), using 
there the approximate expression for p(y) in p7), and 
introducing notation ^ = {a/Q)/{a/Q — 1/2), we obtain 



k{r) = N l-S^e 



(20) 



r Ar(2^/7r)e-«'-/2 if a > C/2, 

< 7V(l4-C^/7r)e-f''/2 if a -> C/2, (21) 

[iV(l - 2C/7r)e-"'' if a < C/2. 



The average degree k in the whole network is now 



k = -i'^N ■ 

IT 



-aR 



R TT f C 



-(7r-l)^ + (7r-2) -1 
a / 



(22) 
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and its limit at a — )■ C/2 is well defined, 

k > N^R ( 1 + ^r] e-««/2 



(23) 



If a/C > 1/2, we can neglect the second term in (22), 
leading to 



TT 



(24) 



That is, the condition controlling the average degree in 
the network changes from ( 14 ) to 



(25) 



where the control parameter v = 7rfc/(2^^). This control 
is the less accurate, the closer the a to Indeed, as 
a approaches C/2, the relative contribution to the total 
average degree coming from the second term in (22 1 in- 



creases. In particular, if a/C, — 1/2, then ^ is undefined, 
meaning that v can no longer be 7rfc/(2^^). If instead of 
solving Eq. (23) to find radius R for given N and k, we 
fix R according to ( |25[ ) with some = fo, then similar 
to [12] , the average degree in ( 23 ) will grow polylogarith- 
mically with the networks size. 



= t'o In 



N 



TT l/Q 



(26) 



If we neglect the second terms in ( 22|20 ) at a/C > 1/2, 
then using (25), we rewrite (20) as 



k{r) — —e'""' ' "~ ^ e 



-Cr/2 



(27) 



That is, somewhat surprisingly, the scaling of the aver- 
age degree k{r) with radius r does not depend on the 
exponent a > C/2 of the node density. Proceeding as in 
Section IV A the degree distribution P{k) for a > Q/2 is 
then 



2a/C 



r(fc~2a/C,fc/C) ,-(2a/C+l) 

k\ " 

(28) 

For arbitrary values of a/C > the degree distribution 
scales as 



P{k) 



with 7 = 




(29) 



We observe that the node density exponent a and 
the space curvature C affect the heterogeneity of net- 
work topology, parameterized by 7, only via their ra- 
tio a/C. This result is intuitively expected in view of 
the analogy to trees discussed in Sections |II|III[ since a 
tree with branching factor 6 = e" is metrically equivalent 
to the two-dimensional hyperbolic space with curvature 
K — —a^ . In other words, the branching factor of a tree 



and the curvature of a hyperbolic space are two differ- 
ent measures of the same metric property — how fast the 
space expands. Result (29 1 states then that the topol- 



ogy of networks built on top of these metric structures 
depends only on the appropriate normalization, a/C, be- 
tween the two measures. 

The model described so far has thus only two pa- 
rameters, a/C ^ 1/2 and > 0, controlling the degree 
distribution shape and average degree. The model pro- 
duces scale-free networks with any power-law degree dis- 
tribution exponent 7 = 2a/C -1-1^2. The uniform node 
density in the hyperbolic space corresponds to a = Ci 
and results in 7 = 3, i.e., the same exponent as in 
the original preferential attachment model [43]. Since 
i = (a/C)/(a/C - 1/2) = (7 - l)/(7 - 2), ^the average 
degree of nodes at distance r from the origin (27), and 
the total average degree in the network k = 2v^^-k are 



k{r) 



■7 ■ 



1 



.7-2, 

A sample network is visualized in Fig. [5] 



(30) 
(31) 



V. HETEROGENEOUS TOPOLOGY IMPLIES 
HYPERBOLIC GEOMETRY 

In the previous section, we have shown that networks 
constructed over hyperbolic spaces naturally possess het- 
erogeneous scale-free degree distributions. In this sec- 
tion we show the converse. Assuming that a scale-free 
network has some metric structure underneath, we show 
that metric distances can be naturally rescaled such that 
the resulting metric space is hyperbolic. 

To accomplish this task we use the S"'^ model from [12] 
where the underlying metric structure is abstracted by 
the simplest possible compact metric space, circle S^. 
This model generates networks as follows. First, N nodes 
are placed, uniformly distributed, on a circle of radius 
N / (27r) so that the node density on the circle is fixed 
to 1. Then each node is assigned its expected degree, 
which is a random variable k drawn from the continuous 
power-law distribution 



p{k) 



(7-1)^" 



K > Kq, 



(32) 



where 7 > 2 is the target degree distribution exponent, 
and Kq is the minimum expected degree. Finally, each 
node pair with expected degrees (k, k') and angular co- 
ordinates {9,0') located at distance d — NA6/{2tt) over 
the circle {A9 = tt — [tt — |6' — 9'\\) is connected with 
probability p{x) i which can be any integrable function of 



X 



IlKK' 



(33) 



where /i > is the parameter controlling the average de- 
gree in the network. This form of the argument of the 
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observe that 




FIG. 5: (Color online) A modeled network with N = 740 
nodes, power-law exponent 7 = 2.2, and average degree k = 5 
embedded in the hyperbolic disk of curvature K = —1 and 
radius R = 15.5 centered at the origin shown by the cross. 
For visualization purposes, we use the native hyperbolic space 
representation Therefore, the shown network occupies a 
small part of the whole hyperbolic plane in Fig.[l] The shaded 
areas show two hyperbolic disks of radius R centered at the 
circled nodes located at distances r — 10.6 (upper node) and 
r = 5.0 (lower node) from the origin. The shapes of these 
disks are defined by ([5| with = 1, and according to the 
model, the circled nodes are connected to all the nodes lying 
within their disks, as indicated by the thick links. In par- 
ticular, the two circled nodes lie within each other's disks. 
The peculiar shape of these disks shows that the hyperbolic 
distance between any two points other than the origin is not 
equal to the Euclidean distance between them. In particular, 
the farther away from the origin are the two nodes, located at 
the same Euclidean distance in the tangential direction, the 
longer is the hyperbolic distance between them, which ex- 
plains why peripheral nodes are not connected to each other, 
and why a majority of links appear radially oriented. 



connection probability function is the only requirement 
to ensure that the average degree k^n) of nodes with ex- 
pected degree k in the constructed network is indeed pro- 
portional to K, specifically k{K)/k = k/k, where k is the 
average degree in the network as before, and 



K = Kp{K)dK 



= Kq 



7-1 
7-2' 



(34) 



Due to this proportionality, the degree distribution in the 
network is indeed power-law distributed with exponent 7. 
To see that condition (33) ensures k{K)/k = k/R, set 

P{x)dx, and 



k{K) 



p{K')p{x)dK'de' 



N 



00 /•N/{2fJ.KK'] 

I ,J \J.J 



Pix)dx 



2fllKK. 



Since 



k — k{K,)p{K)dK — 2/i/K , 



(35) 



(36) 



we conclude that k{K) = nk/R, and confirm that p 
controls the average degree in the network. We also 
note that kq is a dumb parameter, which can be set to 
kq = fc(7 — 2)/(7 — 1) leading to A:(k) — n. 

We now establish the equivalence between this 
model and the model described in the previous sec- 
tion. To do so, we need to find a change of variables 
from K, expected degree of a node, to r, its radial coor- 
dinate on a disk of radius R, such that if variable k is 



power-law distributed according to (32 1, then after this 



K-to-r change of variables, variable r is exponentially dis- 
tributed according to (17). The change of variables that 



accomplishes this task is 



KqC 



C(i?-r)/2 



(37) 



where C > is a parameter defining a in (17 1 after this 
change of variables. The resulting value of a is a = 
C(7 — l)/2, which is the same relationship among a, C, 



and 7 as in ( 29 1 . In other words, after the K-to-r mapping 
above, the nodes get distributed on the disk as in the 
model, suggesting that parameter C is actually the space 
curvature. 

To check if it is indeed the case, and if the two models 
are indeed equivalent, we have to verify that the pairs 
of nodes connected or disconnected in the §^ model with 
expected degree n mapped to radial coordinate r corre- 
spond to, respectively, connected or disconnected nodes 
in the native model. That is, we have to demonstrate 
that the connection probabilities in the two models are 
consistent, p{x) — p{x)- To show this we first fix the 
disk radius R to its value in the model (25), and then 
observe that if we set 



yielding k — vl- 



7- 



1 



7-2 



(38) 



then the change of variables ( 37 ) maps the argument x 
of the connection probability in the model (33) to 



without loss of generality, let / 



X = ef(--«)/2, (39) 

where x is equal to the second approximation of the hy- 
perbolic distance in (|6]). Therefore, the connection prob- 
ability p{x) in the model is approximately equal to the 
connection probability p (^e^'-^"^'/^) in the §^ model. In 
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particular, the step function connection probability ([8]) 
in the H'^ model corresponds to 



p(x) -e(i-x) 



(40) 



2)1: 



in the model. The integral / of this connection prob- 
ability is obviously 1, so that the fc vs /i relationship (36) 
in the model becomes k — 2^ [ko(7 — l)/(7 
which is consistent with the condition v — irfit 
given the k.vs.v relationship in the model (31). 
the final consistency check, we observe that the substitu- 
tion of the K-to-r mapping ( |37[ ) into the proportionality 
A:(k) = Kk/Rin the §^ model yields (30 1 in the model. 



(38) 



That is, the average degrees k{r) of nodes with radial co- 
ordinate r in the and models are the same. 

The two models are thus equivalent, and with the 
appropriate choice of parameters, generate statistically 
the same ensembles of networks, which one can con- 
firm in simulations. In this section the network metric 
structure has been modeled the simplest way, by cir- 
cle = 911^, which by no means is the only possi- 
bility for the hyperbolic space boundary dX, see Sec- 
tion [nj Therefore, the established equivalence between 
the S^and models suggests that as soon as a hetero- 
geneous network has some metric structure induced by 
distances d on dX, this metric structure can be rescaled 
by node degrees k to b ecome hyperbolic, using appro- 
priate modifications of ( 33|39 ). The heterogeneous de- 
gree distribution effectively adds an additional dimen- 
sion to dX (the radial dimension in the = 9EI^ case), 
such that the resulting space X (H^ in the considered 
case) is hyperbolic, a mechanism conceptually similar to 
how time in special relativity, or gravity in [SHlOj makes 
the higher-dimensional (time)space hyperbolic. In other 
words, hyperbolic geometry naturally emerges from net- 
work heterogeneity, the same way as network heterogene- 
ity emerges from hyperbolic geometry in the previous 
section. 



VI. HYPERBOLIC GEOMETRY VERSUS 
STATISTICAL MECHANICS 

In this section we relax the final constraint in the model 
that the connection probability is a step function, and 
provide a statistical mechanics interpretation of the re- 
sulting network ensemble. 

Since p{x) can be any integrable function in the §^ 
version of the model, p{x) can be any function in the 
version. Given this freedom, we consider the following 
family of connection probability functions, 



1 



1 



Pix), (41) 



parameterized by /3 > 0. The p{x) function is integrable 
for any /? > 1, 



(42) 



^ ^ Jo ^^^^'^^^ (^^"^i 



However, we will not restrict /? > 1, and will also consider 

/3e[0,l). ^ 

The main motivation for the connection probability 
choice (41) is that it casts the ensemble of graphs in 
the model to exponential random graphs [TSHTTj. Ex- 



ponential random graphs are maximally random graphs 
subjected to specific constraints, each constraint associ- 
ated with an auxiliary field or Lagrangian multiplier in 
the standard entropy maximization approach, commonly 
used in statistical mechanics. Each graph G in the en- 
semble has probability weig ht P(G) = e-"^^'^/Z, where 
H(G) is the graph Hamiltonian, and Z = e~^^'^^ 
is the partition function. For example, the ensemble 
of graphs in the configuration model, i.e., graphs with 
a given degree sequence {fc^}, is defined by Hamilto- 
nian H{G) = Yji^ih = Y.ij^iO'ij = Y^i<ji^i +^3)0'ij^ 

where uji are the auxiliary fields coupled to nodes i, and 
{ttij} is G"s adjacency matrix. A natural generaliza- 
tion of this ensemble [13] is given by the Hamiltonian 
H{G) — J2i<j ^ijO-ij which the auxiliary fields are 
coupled not to nodes i but to links ij. The partition 
function is then 



(43) 



and the probability of link existence between nodes i and 
J is given by |14j 



Prj 



d\nZ 



1 



e'^-i + 1 



(44) 



The connection probability ( 41 ) thus interprets the aux- 



iliary fields uJij in this ensemble as a linear function of 
hyperbolic distances xij between nodes in the ensemble 
of graphs generated by our model. 



R), 



(45) 



which makes the two ensembles identical. 



The connection probability ( 41 1 is nothing but the 
Fermi-Dirac distribution. It appears because we allow 
only one link between a pair of nodes. If we allowed mul- 
tiple links, or if we considered weighted networks, the 
resulting link statistics would be Bose-Einstein [TH [T5] . 
Hyperbolic distances a; in (41 1 can now be interpreted as 



energies of fermionic links, whereas hyperbolic disk ra- 
dius R is the chemical potential, 2/C is the Boltzmann 
constant, and /? = 1/T is the inverse temperature. The 
ensemble is grand canonical with the number of particles 
or links M fixed on average. The standard definition of 
the chemical potential is then 



M = 



2R 



g{x)p{x) dx, 



(46) 



where g{x) is the degeneracy of energy level x. In our 
case, g{x) is the probability that two nodes are located 
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at distance x from each other. We can compute this 
probabihty to yield 



(47) 



where a, 6 are some constants, and 7 = 2a/(^ + 1. Sub- 
stituting this g{x) in definition (461, using AI = kN/2 



there, and keeping the leading terms, we get 

2 -~PCR/2 



^V7-2y (l-/3)c 



(48) 



where c is another constant which we determine in the 
next section. If /3 > 1, we neglect the second term above, 
and observe that the standard definition of the chemical 



potential in statistical mechanics ( 46 1 yields the same 



result as ( 25 ) , obtained using purely geometric argu- 



ments. The same observation ap plie s for the parameter 
v = N e^''^/^ that we get from (|48 h it is the same as 



in (361 with /i = vK-kk^) and R in (34|, or as in (31 1 if 
temperature T = 0, so that 1=1. 

At T = the system is in the ground, most degener- 
ate state, and all M links occupy the lowest energy levels 
until all of them are filled. In this ground state, Fermi dis- 
tribution (41) converges to the step function ([8|, which 
a posteriori justifies our choice there. At higher tem- 
peratures the fermionic particles start populating higher 
energy states, and at T = 1 we have a phase transition 
caused by the divergence of p{x) leading to a disconti- 
nuity of the partition function (43). This discontinuity 



is due to the discontinuity of the chemical potential R. 
We see from ( 48|42 1 that R diverges as ~ — ln(/? — 1) at 
/3^-l+. If/3<1, then the second term in (48) is the 



leading term, and instead of ( 25 ) we have 



iV = fc(l-/3)ce^^«/2^ 



(49) 



so that at /3 1-, the chemical potential R diverges as 
~ — ln(l — /3). We investigate what effect this phase tran- 
sition has on network topology in the next two sections. 



B. 



< 1 



In this hot regime, the connection probability p{x) 
diverges, and we have to renormalize its integral / = 
J p{x)dx- Specifically, instead of integrating to infin- 
ity as in ( |35[ ), we have to explicitly cut off the inte- 
gration at the maximum value of Xmax = N/{2nKK'). 
The exact value of Jq"^'''' pix)dx with p{x) in (41) is 
2Hi{l,f3-^; l-h/3"^ -Xi'?iax)Xmax, where 2H1 is the Gauss 
hypergeometric function. The leading term of this prod- 
uct for large Xmax and /3 G [0, 1) is Xmax/(1 - substi- 
tuting which int o th e expression for the average degree 
in the §^ model (35) we get 



k{K) 



{k)=k = {2flf{K^f 



7V1-/3 



1-/3 
n^^ p[n)dK, = Kq — 



(50) 
(51) 
(52) 



where 7 is the input value of the 7-parameter in the S'^ 
model, i.e., the distribution of the hidden variable k is 



p{n) 



(7 — 1)k We introduce a new notation 



for this parameter to differentiate it from the value of 
power-law exponent 7 in generated networks, which is 
different from 7 in this hot regime. Indeed, since the 
average degree fc(K) of nodes with hidden variable n is 
no longer proportional to k but to (50), the degree 
distribution in the modeled networks is 



P{k)-^k-^, with7 = (7- l)r-H 1. 



(53) 



The mapping to the model is achieved via the 



same change of variables (37), and by requiring that 
= eC(a;-i?)/2^ Performing this change of variables, and 
noticing that (7 - l)/(7 - ^ - 1) = (7 - l)/(7 - 2), we 
obtain the following key relationships in the model: 



VII. 



DEGREE DISTRIBUTION AT NON-ZERO 
TEMPERATURE 

A. /3 > 1 



Since the connection probability p{x) in ( 41 ) is in- 



tegrable in this cold regime, we immediately conclude 
that the degree distribution is the same power law as 
at the zer o te mperature , w hile the average degree is 
k = 2fj,lR^ (36) with / in (42). In view of the equivalence 
between the S and models established in Section [v] 
the power-law exponent 7 > 2 is related to the model 
parameters C > and a > C,/2 via 7 = 2a/( + 1, as at 
T = 0. Th e chem ical potential is i? = (2/C) \n{N/i^) with 
1/ = TTfinl ( |25|38| ). 



a a 
2^T+1, 7 = 2^ + 1, 



7 

k{r) 
k 



j j - ^ ^K{R-r)/2 
7-1 

V /2 



7-1 
7-2 



2\fi j^jl-P 



(54) 
(55) 

(56) 
(57) 



The last two equations fill in the c coefficient in the ex- 



pression for the chemical potential ( 49 ) 



Finally, we note that in the hot regime the admissible 
range of input parameters controlling the degree distri- 
bution exponent 7 is 7 > /3 + 1 (S^) or a > /3C/2 (H^), 
both yielding 7 > 2. 



11 



lO 



















-tAt- 7=2.2; simulations 
^ 7=2.2; theory 
-Q- 7=2.5; simulations 

7=2.5; theory 
HZI- 7=3.0; simulations 
-0- 7=3.0; theory 


7=2.5 



0.1 0.2 0.3 0.4 0.5 0.6 0.7 



FIG. 6: (Color online) Average clustering c as a function of 
temperature T = 1//3 £ [0,1). The simulation results are 
averaged across 100 networks with average degree k = 6 and 
N = 10^ nodes each. The average clustering is calculated 
excluding nodes of degree 1. The theoretical results are ob- 
tained via the numerical integration of c = J^c(K)p(K)dK 



with c(k) given by Eq. (591. The stronger disagreement be- 
tween simulations and theory for smaller values of 7 is due to 
the increasingly pronounced finite-size effects |44| . 



VIII. CLUSTERING AS A FUNCTION OF 
TEMPERATURE 



A. /3 > 1 

In the cold regime, the average clustering c is a de- 
creasing function of temperature, see Fig. |6] Clustering 
is maximized at T = 0, and it gradually, almost linearly, 
decreases to zero at the phase transition point T = 1. 

Unfortunately, c cannot be computed analytically, but 
some estimates for specific values of (3 are possible. The 
average clustering c(k) of nodes with expected degree 
K in the model is the probability that two nodes 
with expected degrees and angular coordinates {k',9') 
and {k",6"), both connected to node with {k,0) (we set 
6 = without loss of generality), are connected to each 
other. Introducing notations for the three rescaled dis- 
tances x' = N9'/{2TrfiKK'), x" = Ne"/{2TT^iKK"), and 
X = NAe/{2TTiJ,K'K"), where AO = \e' - 9"], this proba- 
bility is given by [40] 



c(k) 



N 



dn'dK" p(k')p{k") 



d9'd9"p{x!)p{}^')p{x)- 



(58) 



Changing the integration variables from 9' and 9" to x' 
and x" ill the second integral, extending the integration 
limits to infinity, and using the expression for the average 



k'/k 



-k7k.. 



k|x'/k"-x"/k'|<1 



Ix1<i 



lx"l<i 



k7k 



^k'/k 



FIG. 7; (Color online) The inner integral in (59 1 at the zero 
temperature is the dark shaded area in the center. 



degree in the model ( 35 ) , yield 
1 



c(k) = 



„ , , dK'dK"K'pU')K"p{K") (59) 

{2lRfJJ., 



dx'dx"p{\x' mix" \)P [ « 



X X 



At r = 0, / = 1, while p{x) 6(1 - x)- There- 
fore, the inner integral in the last expression reduces 
to the area of the intersection of the square defined in 
the (x')X") coordinates by {|x'| < 1; \x"\ < Ijj and the 
stripe k\x' / — x'V'^'l < 1; Fig. [7j For small k, the 
stripe is so wide for almost any combination of (k',k") 
that it fully contains the square, whose area is 4, so that 
c{kq) w 1 proving that clustering is maximized at the 
zero temperature. Recall that clustering cannot be 1 for 
all node degrees because of structural constraints [45]. 
For arbitrary values of k, the exact expression for the in- 
tersection area involves cumbersome combinatorial con- 
ditions for the mutual relationship among n, k' , and k", 
which make taking the outer integral in (59 1 problematic. 
However one can check that for large k, c(k) = g{^)Ko/K, 
where 5(7) is a decreasing function of 7. 

For any other values of T g (0, 1), the inner integral 
in (59) can be taken by residues, but the number of poles 



depends on /3 = 1/T. At /3 = 2, for example, the inner 
integral is tt'^k'k" /{kk' + kk" + k'k"), so that for 7 = 3 
we have the exact expression for c{k) 

c{k) = K(){{2k + K())ln{2K + kq) — 2{k + KQ)hi{K + kq) 
+ KolnKo}/K^, (60) 

and c(ko) = ln(27/16) = 0.52, while c{k) = (ln4)Ko/K 
for large n. For other values of 7, one can show that 
c(k) = g(7)Ko/K, where 3(7) is also a decreasing function 
of 7. 

In other words, the degree-dependent clustering c(k) 
decays with k as ~ an effect that was considered as 
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a signature of the hierarchical organization of complex 
networks l46ll47l. 



B. 



/3< 1 



In the hot regime, temperature has no effect on 
clustering, which is always zero for large networks. 
This effect can be confirmed in simulations, and seen 
analytically. Indeed, observe that in view of l\50\ 
52), the ^-to-x change of variables, turning ( |58[ ) 
into ([59| , now yields the pre- factor in the latter equal to 

i^iK)^'^ (1 - /3) / {2^^kP)N^~^)] ^ instead of 1/{2Ik)\ 
This new pre-factor is obviously zero in the thermody- 
namic limit. 



IX. CONNECTION TO THE CONFIGURATION 
MODEL AND CLASSICAL RANDOM GRAPHS 



Since clustering does not depend on temperature in the 
hot regime, while the power-law exponent (54) depends 



on temperature via the ratio T/C, we can let T — ^ oo and 
C — oo, but fix their ratio to be a new parameter rj = 
C/T. With this parameter the key equations ( 54|56|57 ) 
in the H-^ model become 



7 = 2- 



1, 



V 



N = iye'"'/^ (61) 



But since curvature C = oo, the last A0-dependent term 
in the expression for the hyperbolic distance (|6| is zero. 
Since this term reflects the presence of the metric struc- 
ture in the network, its disappearance effectively destroys 
this structure. More formally, the network metric struc- 
ture becomes degenerate, because the hyperbolic dis- 
tance Xij between a pair of nodes 
sum of their radial coordinates, Xi 



and j reduces to the 
= r,; -|- rj , as a result 



of which the auxiliary fields (45) decouple, aj.y = 



where = ri{ri — R/2)/2. Therefore, the probability pij 
of the existence of link ij in (44) depends now only on 
the product of i,j's expected degrees k{ri),k{rj) given 
by (55 1, Pij = [k{ri)k{rj)]/[kN], so that the network en- 



semble becomes the ensemble of networks in the config- 
uration model, i.e., the ensemble of graphs with given 
expected degrees [TS]. 

Alternatively, we can keep both a and ^ finite while 
heating the networks up by increasing T — ^ oo. In this 
case, Eqs. ( 54p7 l converge to 7 — )■ oo, fc(r) — >• fc, fc — >• i/, 
and i? — )• 00, while the Fermi-Dirac connection proba- 
bility (41 1 becomes uniform p{x) p = k/N. That is, 
all nodes get uniformly distributed on the boundary at 
infinity 5EI^, and each pair of nodes is connected with 
the same probability p, independent of their distances. 
We note that the distance between two points i,j G dM^ 
with angular coordinates 6i, 9j is Xij — sm(A6ij /2) |26j — 



TABLE II: Network properties in the model — average de- 
gree k, power-law exponent 7, and average clustering c — and 
the model parameters controlling these properties, with ref- 
erences to the corresponding equations. 



Property 


Cold regime 


Hot regime 










k 


M ( 


361 


( 


B8 


1 


M ( 


511 


v ( 


561 


7 


7 ( 


321 


a/C 


(291 


7 ( 


53 1 


a/C 


(541 


c 


P = l/T (Fig. 


61 






compare with ml in H and with Xi 



in the other 



limiting case, the configuration model. The limiting de- 
gree distribution is Poissonian (7 — )> 00), and the network 
ensemble converges to the ensemble of classical random 
graphs Gn,p with a given average degree k = pN jl9| . 
The network in this case loses not only its metric struc- 
ture, but also its hierarchical heterogeneous organization. 

Here, we finish the description and analysis of our ge- 
ometric model of complex networks. To summarize, the 
model can produce scale-free networks with any average 
degree k, power-law exponent 7 > 2, and average cluster- 
ing c, controlled, respectively, by parameters (7, /i, f3) and 
{a/C,iy,f3) in the and formulations of the model, 
see Table [n] In Fig. [8] we observe a good match between 
the basic topological properties of the real Internet and a 
synthetic network generated by the model with an ap- 
propriate choice of parameters in the cold regime. In the 
hot regime the model subsumes the standard configura- 
tion model and classical random graphs as two different 
limiting cases with degenerate geometric structures. 



X. EFFICIENCY OF GREEDY NAVIGATION IN 
MODELED NETWORKS 

In this section we shift our attention from the analysis 
of the structure of complex networks in our model to the 
analysis of their function. Specifically, we are interested 
in their navigation efficiency. 

One important function that many real networks per- 
form is to transport information or other media. Exam- 
ples include the Internet, brain, or signaling, regulatory, 
and metabolic networks. The information transport in 
these networks is not akin to diffusion. Instead informa- 
tion must be delivered to specific destinations, such as 
specific hosts in the Internet, neuron groups in the brain, 
or genes and proteins in regulatory networks. In the lat- 
ter case, for example, the network reacts to an increased 
concentration of some sugar by expressing not all but 
very specific proteins, the ones responsible for digesting 
this sugar. At the same time the nodes in the network 
are not aware of the global network structure, so that the 
questions we face are if paths to specific destinations in 
the network can be found without such global topology 
knowledge, and how optimal these paths can be. 

The salient feature of our model is that it allows one to 
study the efficiency of such path finding without global 
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FIG. 8: (Color online) The Internet as seen by the CAIDA's Archipelago Measurement Infrastructure [48] vs. a network in the 
]SP model with a = 0.55, C — li ^-nd P = 2. (a) The degree distributions P{k) in both networks are power laws with exponent 
7 = 2.1. The theoretical curve is given by (28l. (b) The average nearest neighbor degrees k„n{k). (c) The degree-dependent 
clustering. The theoretical curve is obtained by a numerical estimate of the outer integral in (591. The inner integral is 
TV^ k' k" /{kk' + ) at /3 = 2. The numerical integration is performed by summation over the node degrees k in the 

modeled network, i.e., J dKp{n) — >■ "^Zk-^i^)^ ^^'^ rnapping k's to fc's via k = kR/k. Random graphs capturing the three 
metrics in (a-c) reproduce also many other important structural properties of the Internet [15] . 



knowledge, because our networks have underlying geom- 
etry which enables greedy forwarding (GF). Since each 
node in the network has its address, i.e., coordinates in 
the underlying hyperbohc space, a node can compute 
the distances between each of its neighbors in the net- 
work, and the destination whose coordinates are writ- 
ten in the information packet, or encoded in the signal. 
GF then accounts to forwarding the information to the 
node's neighbor closest to the destination in the hyper- 
bolic space. Since each node knows only its own address, 
the addresses of its neighbors, and the destination ad- 
dress of the packet, no node has any global knowledge of 
the network structure. 

We report simulation results for two forms of GF, orig- 
malGF (OGF) and modified GF (MGF). The OGF algo- 
rithm drops the packet if the current node is a local min- 
imum, meaning that it does not have any neighbor closer 
to the destination than itself. The MGF algorithm ex- 
cludes the current node from any distance comparisons, 
and finds the neighbor closest to the destination. The 
packet is dropped only if this neighbor is the same as the 
packet's previously visited node. 

These GF processes can be very inefficient. They can 
often get stuck at local minima, or even if they suc- 
ceed reaching the destination, they can travel along paths 
much longer than the optimal shortest paths available in 
the network. Furthermore, even if they are efficient in 
static networks, their efficiencies can quickly deteriorate 
in the presence of network topology dynamics, e.g., they 
can be vulnerable with respect to network damage. 

To estimate the GF efficiency in static networks, we 
compute the following metrics: (i) the percentage of suc- 
cessful paths, Ps, which is the proportion of paths that 
reach their destinations; (ii) the average hop-length h 
of successful paths; and (iii) the average and maximum 
stretch of successful paths. We consider three types of 
stretch. The first stretch is the standard hop stretch 
defined as the ratio between the hop-lengths of greedy 



paths and the corresponding shortest paths in the graph. 
We denote its average and maximum by si and max(si). 
The optimal paths have stretch equal to 1. The other two 
stretches are hyperbolic. They measure the deviation of 
the hyperbolic length, traveled by a packet along either 
the greedy or shortest path, from the hyperbolic distance 
between the source and destination. Formally, let {s,t) 
be a source-destination pair and let s — ho, hi, h-r — t 
be the greedy or shortest path between s and t, and r its 
hop length. Further, let Xi,i = 1...T, be the hyperbolic 
distance between hi and The hyperbolic stretch is 

the ratio '^iXi/xst, where Xst is the hyperbolic distance 
between s and t. For greedy paths, we denote the av- 
erage and maximum of this stretch by S2 and max(s2); 
for shortest paths those are denotes by S3 and max(s3). 
The lower these two stretches, the closer the greedy and 
shortest paths stay to the hyperbolic geodesies, and the 
more congruent we say the network topology is with the 
underlying geometry. 

We first focus on static networks, where the network 
topology does not change, and then emulate the net- 
work topology dynamics by randomly removing one or 
more links from the topology. For each generated net- 
work instance, we extract the giant connected component 
(GCC), and perform GF between 10"' random source- 
destination pairs belonging to the GCC. All the met- 
rics converge after approximately 10'^ source-destination 
pairs, but we evaluate an order of magnitude more com- 
binations for more reliable results. The network size is 
N = 10'*, and the average degree is = 6.5 in all the ex- 
periments, while temperature T — Q only in the following 
two subsections. 



A. Static networks 

Figure [9] shows the results for static networks, aver- 
aged, for each 7, over five network instances. We see 
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FIG. 9: (Color online) Greedy forwarding in static networks. 
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FIG. 10: (Color online) Greedy forwarding in dynamic net- 
works. 



that the success ratio Ps increases and path length h and 
all the stretches decrease as we decrease 7 to 2. Re- 
markably, for 7 = 2.1, e.g., equal to the 7 observed in 
the Internet, OGF and MGF yield Ps = 99.92% and 
Ps = 99.99%, with the OGF's maximum stretch of 1, 
meaning that all greedy paths are shortest paths. Inter- 
estingly, the hyperbolic stretch of shortest paths (53 and 
max(s3)) is slightly worse (larger) than of greedy paths 
(s2 and max(s2)), which allows us to informally say that 
for small 7's, greedy paths are "shorter than shortest" as 
they are shortest hop- wise, but GF tends to select among 
many shortest paths those of least hyperbolic stretch. 

In summary, GF is exceptionally efficient in static net- 
works, especially for the small 7's observed in the vast 
majority of complex networks, including the Internet. 
The GF efficiency is maximized in this case, and the two 
algorithms exhibit almost the best possible performance, 
with reachability reaching almost 100%, and all greedy 
paths being optimal (shortest). 



B. Dynamic networks 

We next look at the GF performance in dynamic net- 
works with link failures. For each 7, we randomly select 
a network instance from above and remove one or more 
random links in it. We consider the following two link- 
failure scenarios. In Scenario 1 we remove a percentage 
Pr, ranging from 0% to 30%, of all links in the network, 
compute the new GCC, and for all source-destination 
pairs remaining in it, we recompute the new success ra- 
tio p"®™ , and the average and maximum stretch s"'^™ and 
max(s"'^'"). In Scenario 2 we provide a finer-grain view 
focusing on paths that used a removed link. We remove 



one link from the network, compute the new GCC, and 
for the source-destination pairs that are still in it, we find 
the percentage of successful paths, only among those 
previously successful paths that traversed the removed 
link. For these still-successful paths, we also compute 
the new average and maximum stretch s\ and max{s\). 
We then repeat the procedure for 1000 random links, and 
report the average values for p[ and s\ , and the maximum 
value for max(s5^). 



Figure [TO] presents the results. We see that for small 
7's, the success ratio p"^"^ remains remarkably high, for 
all meaningful values of pr- For example, MGF on net- 
works with 7 = 2.1 and pr 10%, yields > 99%. 
The simultaneous failure of 10% of the links in a network 
such as the Internet is a rare catastrophe, but even after 
such a catastrophe the success ratio in our synthetic net- 
works is above 99%. The average stretch s"*^™ slightly 
increases as we increase Pr, but remains quite low. We 
do not show max(s"'^'") to avoid clutter. For 7 = 2.1, 
max(s^^'") ^ 2. The percentage p^ of MGF paths that 
used a removed link and that found a by-pass after its 
removal is also remarkably close to 100% for small 7's. 
The average stretch s\ in Scenario 2 also remains low, be- 
low 1.1, and the maximum stretch max(s']^) never exceeds 
1.5. 

In summary, GF is not only efficient in static networks, 
but its efficiency is also robust in the presence of network 
topology dynamics. In particular, for small 7's matching 
those found in real networks such as the Internet, GF 
maintains remarkably high reachability and low stretch, 
even after catastrophic damages to the network. 
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FIG. 11: (Color online) Greedy forwarding as a function 
temperature T. 
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FIG. 12: (Color online) Greedy forwarding in the configura- 
tion model. 



C. Role of clustering 



Next we fix 7 = 2.1, 



mancc as a function of temperature in Fig. 11 



and investigate the GF perfor- 
The pic- 



ture is qualitatively similar to Fig. [9j The GF efficiency 
is the better, the smaller is the temperature, i.e., the 
stronger the clustering, see Fig. [6j At zero temperature 
where clustering is maximized, GF demonstrates the best 
possible performance, as discussed in Section pCA| 



D. Random graphs 



does not admit a simple and efficient solution |50j . 

To test classical random graphs, we assign to nodes 
their angular coordinates uniformly distributed on 
[0,27r], connect each node pair with the same proba- 
bility p = kN, and compute distances according to 
Xij — sm{A9ij /2). Greedy forwarding is extremely inef- 
ficient in this case. The OGF and MGF average success 
ratios Ps are 0.17% and 0.21%. 

In summary, the hierarchical organization (heteroge- 
neous degree distribution) and metric structure (strong 
clustering) in the network are both critically important 
for network navigability. 



Finally, we look at the GF performance in the con- 
figuration model and classical random graphs, which are 
two different degenerate cases with zero clustering in our 
geometric network ensemble, see Section [Tx} 

To test the configuration model, we fix a = 1/2, so 
that 7 = l/?7 + 1, compute distances between nodes i,j 
according to Xij = ri+rj, and show the results in Fig. 12 
We observe that the GF efficiency is poor in this case. 
The success ratio Ps never exceeds 40%, and drops to 
below 10% for large 7's. This poor performance is ex- 
pected. Indeed, since Xij = + rj, GF reduces to fol- 
lowing the node degree gradient. Each node just for- 
wards the packet to its highest-degree neighbor h since 
this neighbor has the smallest radial coordinate r^, thus 
minimizing the distance to the destination. If during this 
process the packet reaches the highest-degree hub in the 
network core, without visiting a node directly connected 
to the destination, then it gets stuck at this hub, because 
no angular coordinates instructing in what direction to 
exit the core are any longer available — a problem, which 



E. Why hierarchical structure and strong 
clustering ensure efficient navigation 

We have seen that more heterogeneous networks 
(smaller 7) with stronger clustering (smaller T) are more 
navigable. Here, we explain why it is the case. 

We first recall that the congruency, measured by the 
hyperbolic stretch, between the network topology and 
hyperbolic geometry is the stronger, the smaller are the 
7 and T, see Figs. |9|11[ To visualize this effect, we draw 
in Fig. 13 [a.) a couple of GF paths and their correspond- 
ing hyperbolic geodesies. We see that the lengths of the 
latter are indeed dominated by the sums of the radial co- 
ordinates of the source and destination, minus some A0- 
dependent corrections ([6|. This domination of the radial 
direction shapes the following hierarchical path pattern 
of the hyperbolic geodesies, as well as of the correspond- 
ing GF paths: (i) zoom-out from the network periphery 
to the core, moving to increasingly higher-degree nodes, 
that is, nodes covering increasingly wider areas by their 
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(a) (b) (c) 

FIG. 13: (Color online) (a) Two greedy paths, which are also shortest paths (si = 1), from the source at the top to two 
destinations are shown by the solid arrows. The dashed curves are the hyperbolic geodesies between the same source and 
destinations. The hyperbolic stretches S2 = S3 of the left and right paths are 1.51 and 1.68. (b) The inner triangular shape 
(green) shows the angular sector 6 that the outer shape (red), which is the hyperbolic disk of radius R centered at the circled 
point located at distance r from the crossed origin, cuts out off the dashed circle of radius r' centered at the origin. The 
expected node degrees at r and r' are k and k'. (c) The circled node is an example of a bridge node. It is connected to all 
nodes in its hyperbolic disk of radius R (the outer shape (red)), including all nodes with expected degrees exceeding a certain 
threshold, or, equivalently, to all nodes with radial coordinates below a certain threshold, shown by the innermost disk (green) 
whose radius is 7? — r, where r is the radial coordinate of the circled node. 



connections, see Fig. [5] (ii) turn in the core to the di- 
rection of the destination; and finally (iii) zoom-in onto 
it, moving to lower-degree nodes. This path pattern is 
exactly the pattern of hierarchical paths in |51| . A path 
is called hierarchical in [51' if it consists of two segments: 
first, a segment of nodes with increasing degrees, and 
then a segment of nodes with decreasing degrees. As 
shown in [STj (see Fig. 2(a) there), the percentage of 
shortest paths that are also hierarchical approaches 100% 
with 7 — > 2. Remarkably, this hieratical path pattern 
also characterizes the policy-compliant paths followed by 
information packets in the Internet |52l 153] . Since the 
GF paths, also the shortest paths in the network, follow 
the shortest geodesic paths in the hyperbolic space, the 
resulting hyperbolic stretch is small. Thanks to strong 
clustering, the network has many partially disjoint paths 
between the same source and destination, which all fol- 
low the same hierarchical pattern. Therefore, even if 
some paths are damaged by link failures, other congruent 
paths remain, and GF can still find them using the same 
hyperbolic geodesic direction, which explains the high ro- 
bustness of network navigability with respect to network 
damage. As clustering weakens, not only the path diver- 
sity in the network decreases, but also the network metric 
structure deteriorates, since the edge existence probabil- 
ity ( 41 1 depends less and less on the hyperbolic distance 
between nodes. In the extreme case of classical random 
graphs, for example, the connection probability does not 
depend on this distance at all. As a result, the congru- 
ency between network topology and underlying geometry 
evaporates. 



Heterogeneity is another key element responsible for 
high navigability. This heterogeneity is nothing but a re- 
flection of the hierarchical, tree-like structure of the un- 
derlying hyperbolic space. Indeed, its hierarchical struc- 
ture manifests itself in the hierarchy of node degrees, 
and in the degree-dependent amount of space that nodes 
cover by their connections. As Fig. [5] shows, nodes of 
higher degrees, closer to the top of the hierarchy, cover 
wider areas with their connections. To quantify, at T = 
the angular sector 0(k, k') that nodes with expected de- 
gree K cover by their connections to nodes with expected 
degree k', see Fig. [TsJ^b), is 9{k,k') = A-n^nn' /N. This 
degree-dependent hierarchy of space coverage makes the 
hierarchical zooming-out / zooming-in path pattern possi- 
ble and successful. 

Finally, the stronger the heterogeneity, the more 
bridges are in the network, where by bridges we mean 
nodes that connect to all nodes with expected degrees 
exceeding a certain threshold, an example is shown in 
Fig. llSFc). This threshold is given by the equation 
9{k, k) = 27r, yielding that a node with expected de- 
gree K is connected to all nodes with expected degrees 
kI > N/{2^k). However such ^'-degree nodes may not 
exist in the network, as the required k' may exceed the 
maximum expected degree Kmax = koN^/^'^~^^ [44l. Re- 
quiring k' < Kinax leads to K > N''^-^'^/'-''-^'^/{2fiKo). 
That is, only such K-degree nodes are expected to be 
bridges. The equation for the expected bridge existence 
is then K < Kmax, yielding N'-''-^^^'^^-^y{2fii4) < 1. 
That is, bridges exist in any sufhciently large network 
with 7 < 3 — the smaller the 7 is, the more bridges and 
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the longer they are — while networks with 7 > 3 have 
no bridges. The role of bridges in the network core is 
straightforward: as soon as GF reaches a bridge, it can 
cross the entire network, in any direction, at one hop |54j . 
Without bridges, GF is doomed to wander along the net- 
work periphery, endangered by getting lost there at any 
hop. The GF success ratio in networks with 7 > 3 dete- 
riorates to zero in the thermodynamic limit |55, . 

XI. CONCLUSION 

We have developed a framework to study the struc- 
ture and function of complex networks in purely geomet- 
ric terms. In this framework, two common properties 
of complex network topologies, strong heterogeneity and 
clustering, turn out to be simple reflections of the basic 
properties of an underlying hyperbolic geometry. Het- 
erogeneity, measured in terms of the power-law degree 
distribution exponent, is a function of the negative cur- 
vature of the hyperbolic space, while clustering reflects 
its metric property. 

Conversely, a heterogeneous network with a metric 
structure has an effective hyperbolic geometry under- 
neath. This finding sheds light on self-similarity in com- 
plex networks }12j . The network renormalization proce- 
dure considered in [H] — throwing out nodes of degrees 
exceeding a certain threshold — is equivalent to contract- 
ing the radius of the hyperbolic disk where all nodes 
reside. This contraction is a homothety along the ra- 
dial direction, which is a symmetry transformation of 
the hyperbolic space, and self-similarity of hyperbolic 
spaces with respect to such honiothetic transformations 
has been formally defined and studied [26] . Self-similarity 
of complex networks thus appears as a reflection of self- 
similarity of hyperbolic geometry, or as the invariance 
with respect to symmetry transformations in the under- 
lying space. 

The developed framework establishes a clear connec- 
tion between statistical mechanics and hyperbolic geom- 
etry of complex networks. The collection of edges in a 
network, for example, can be treated as a system of non- 
interacting fermions whose energies are the hyperbolic 
distances between nodes. This geometric interpretation 
may lead to further developments applying the standard 
tools of statistical mechanics to network analysis. 

The network ensemble in our framework subsumes 
the standard configuration model and classical random 
graphs as two limiting cases with degenerate geomet- 
ric structures. The hyperbolic distance between two 
nodes ^ delicately combines their radial and angular 
coordinates. In the configuration model, the distance 
degenerates to the sum of radial coordinates only, de- 
stroying the network metric structure. In classical ran- 
dom graphs, on the contrary, there is no radial distance 
dependence. The connection probability between nodes 
does not depend on any distances at all. As a result, 
not only the metric structure of a network but also its 



hierarchical heterogeneity gets completely destroyed. 

We have shown that both these properties, strong clus- 
tering and hierarchical heterogeneous organization, are 
critically important for navigability, which is the network 
efficiency with respect to targeted transport processes 
without global knowledge. Such processes are impossible 
without auxiliary metric spaces since global knowledge 
of network topology would be unavoidable in that case. 
The developed framework not only provides a set of tools 
to study these processes, but also explains why and how 
strong clustering and hierarchical network organization 
makes them efficient. 

We have observed that the strongest clustering and 
strongest heterogeneity, often found in real networks, 
lead to optimal navigability. The transport efficiency is 
the best possible in this case, according to all efficiency 
measures. Yet more remarkable is that this efficiency is 
extremely robust with respect to even catastrophic dis- 
turbances and damages to the network structure. 

Complex networks thus appear to have the optimal 
structure to route information or other media through 
their topological fabric. No complicated and artificial 
routing schemes or constructions, impossible in nature 
anyway, turn out to be needed to route information opti- 
mally through a complex network. Its geometric under- 
pinning drastically simplifies the routing function, mak- 
ing efficient the "dumb" strategy of transmitting infor- 
mation in the right hyperbolic direction toward the des- 
tination. 

Does signaling in real networks, such as cell signaling 
pathways or the brain, follow hyperbolic geodesies, and 
if it does then what network perturbations might break 
this signaling, potentially causing (lethal) diseases? To 
answer these questions, one has first to map a real net- 
work to its underlying space, finding the coordinates for 
each node. In our recent work to reduce the routing com- 
plexity in the Internet 156] , we map the Internet to its hy- 
perbolic space using statistical inference methods. These 
methods work well, but require substantial manual inter- 
vention, and do not scale to large networks. An inter- 
esting open problem is thus to find constructive mapping 
methods, e.g., deriving the underlying distances between 
nodes from their similarity measures based on node at- 
tributes and annotations in a given network. 
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